Noise-robust ASR by Using Disti Approximated with Logarithmic No
نویسنده
چکیده
Various approaches focused on noise-robustness have been investigated with the aim of using an automatic speech recognition (ASR) system in practical environments. We have previously proposed a distinctive phonetic feature (DPF) parameter set for a noise-robust ASR system, which reduced the effect of high-level additive noise[1]. This paper describes an attempt to replace normal distributions (NDs) of DPFs with logarithmic normal distributions (LNDs) in HMMs because DPFs show skew symmetry, or positive and negative skewness. The HMM with the LNDs was firstly evaluated in comparison with a standard HMM with NDs in an experiment using an isolated spoken-word recognition task with clean speech. Then noise robustness was tested with four types of additive noise. In the case of DPFs as an input feature vector set, the proposed HMM with the LNDs can outperform the standard HMM with the NDs in the isolated spoken-word recognition task both with clean speech and with speech contaminated by additive noise. Furthermore, we achieved significant improvements over a baseline system with MFCC and dynamic feature-set when combining the DPFs with static MFCCs and ∆P.
منابع مشابه
Normalized Autocorrelation based Features for Robust Speech Recognition in Context with Noisy Environment
This paper presents a robust approach for an automatic speech recognition system (ASR) when both additive and convolutional noises corrupt the speech signal. Robust features are derived by assuming that the corrupting noise is stationary and the channel effect is fixed during the utterance. In the proposed method the effect of additive and convolutional distortions are minimized by two stage fi...
متن کاملIs speech enhancement pre-processing still relevant when using deep neural networks for acoustic modeling?
Using deep neural networks (DNNs) for automatic speech recognition (ASR) has recently attracted much attention due to the large performance improvement they provide for a variety of tasks. DNNs are known to be robust to overfitting and to be able to remove speaker variability. Another important cause of variability in speech is the presence of noise. A lot of research has been undertaken on noi...
متن کاملA novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech
We propose a novel framework for noise robust automatic speech recognition (ASR) based on cochlear implant-like spectrally reduced speech (SRS). Two experimental protocols (EPs) are proposed in order to clarify the advantage of using SRS for noise robust ASR. These two EPs assess the SRS in both the training and testing environments. Speech enhancement was used in one of two EPs to improve the ...
متن کاملOptimizing the Implementation of MMSE Enhancement for Robust Speech Recognition
In this paper several methods are proposed to optimize the implementation of minimum mean-square error (MMSE) estimation algorithm for robust automatic speech recognition (ASR). In the calculation of MMSE enhancement algorithm, the original confluent hyper-geometric function is approximated by a piece-wise linear function, which greatly reduces the computation load while keep the same performan...
متن کاملFrom Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR
The multi-band processing paradigm for noise robust ASR was originally motivated by the observation that human recognition appears to be based on independent processing of separate frequency sub-bands, and also by “missing data” results which have shown that ASR can be made significantly more robust to band-limited noise if noisy sub-bands can be detected and then ignored. Of the different mult...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003